A Novel Genetic Algorithm Based Method for Building Accurate and Comprehensible Churn Prediction Models
نویسندگان
چکیده
Customer churn has become a critical problem for all companies in particular for those that are operating in service-based industries such as telecommunication industry. Data mining techniques have been used for constructing churn prediction models. Past research in churn prediction context have mainly focused on the accuracy aspect of the constructed churn models. However, in addition to the accuracy, comprehensibility aspect should be considered in evaluating a churn prediction model. Being comprehensible, a model can reveal the main reasons for customer churn; thereby mangers can use such information for effective decisions making about marketing actions. In this paper, we demonstrate the application of a genetic-algorithm (GA) method for building accurate and comprehensible churn prediction model. The proposed method, GA-based method uses a wrapper based feature selection approach for choosing the best feature subset. The key advantage of this method, is taking into account the comprehensibility measure (measured as the number of rules extracted from C4.5 decision tree) in evaluating the performance of a candidate model. The GA-based method is compared to the two filter feature selection methods including Chi-squared based and Correlation based feature selection using two telecommunication churn datasets. The results of experiments indicated that the GA-based method performs better than the two filter methods in terms of both accuracy and comprehensibility Article history : Received: September 9, 2013 Revised: December 10, 2013 Accepted: December 28, 2013
منابع مشابه
Building comprehensible customer churn prediction models with advanced rule induction techniques
Customer churn prediction models aim to detect customers with a high propensity to attrite. Predictive accuracy, comprehensibility, and justifiability are three key aspects of a churn prediction model. An accurate model permits to correctly target future churners in a retention marketing campaign, while a comprehensible and intuitive rule-set allows to identify the main drivers for customers to...
متن کاملA Neuro-Fuzzy Classifier for Customer Churn Prediction
Churn prediction is a useful tool to predict customer at churn risk. By accurate prediction of churners and non-churners, a company can use the limited marketing resource efficiently to target the churner customers in a retention marketing campaign. Accuracy is not the only important aspect in evaluating a churn prediction models. Churn prediction models should be both accurate and comprehensib...
متن کاملDetermination of Algorithms Making Balance Between Accuracy and Comprehensibility in Churn Prediction Setting
Predictive modeling is a useful tool for identifying customers who are at risk of churn. An appropriate churn prediction model should be both accurate and comprehensible. However, reviewing the past researches in this context shows that much attention is paid to accuracy of churn prediction models than comprehensibility of them. This paper compares three different rule induction techniques from...
متن کاملApplication of Genetic Algorithm Based Support Vector Machine Model in Second Virial Coefficient Prediction of Pure Compounds
In this work, a Genetic Algorithm boosted Least Square Support Vector Machine model by a set of linear equations instead of a quadratic program, which is improved version of Support Vector Machine model, was used for estimation of 98 pure compounds second virial coefficient. Compounds were classified to the different groups. Finest parameters were obtained by Genetic Algorithm method ...
متن کاملPrediction of soil cation exchange capacity using support vector regression optimized by genetic algorithm and adaptive network-based fuzzy inference system
Soil cation exchange capacity (CEC) is a parameter that represents soil fertility. Being difficult to measure, pedotransfer functions (PTFs) can be routinely applied for prediction of CEC by soil physicochemical properties that can be easily measured. This study developed the support vector regression (SVR) combined with genetic algorithm (GA) together with the adaptive network-based fuzzy infe...
متن کامل